NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration

https://doi.org/10.1145/3690624.3709196

Ai, Mengting; Wei, Tianxin; Chen, Yifan; Zeng, Zhichen; Zhao, Ritchie; Varatkar, Girish; Rouhani, Bita Darvish; Tang, Xianfeng; Tong, Hanghang; He, Jingrui (July 2025, ACM)

Free, publicly-accessible full text available July 20, 2026
Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation

Bao, Wenxuan; Zeng, Zhichen; Liu, Zhining; Tong, Hanghang; He, Jingrui (April 2025, ICLR 2025)

Free, publicly-accessible full text available April 24, 2026
Joint Optimal Transport and Embedding for Network Alignment

Qi, Yu; Zeng, Zhichen; Yan, Yuchen; Ying, Lei; Srikant, R; Tong, Hanghang (April 2025, WWW 2025)

Free, publicly-accessible full text available April 28, 2026
PyG-SSL: A Graph Self-Supervised Learning Toolkit

https://doi.org/10.1145/3746252.3761620

Zheng, Lecheng; Jing, Baoyu; Li, Zihao; Zeng, Zhichen; Wei, Tianxin; Ai, Mengting; He, Xinrui; Liu, Lihui; Fu, Dongqi; You, Jiaxuan; et al (November 2025, ACM)

Free, publicly-accessible full text available November 10, 2026
Discrete-state Continuous-time Diffusion for Graph Generation

Xu, Zhe; Qiu, Ruizhong; Chen, Yuzhong; Chen, Huiyuan; Ran, Xiran; Pan, Mengha; Zeng, Zhichen; Das, Mahashweta; Tong, Hanghang (December 2024, NeuIPS 2024)

Full Text Available
AIM: Attributing, Interpreting, Mitigating Data Unfairness

https://doi.org/10.1145/3637528.3671797

Liu, Zhining; Qiu, Ruizhong; Zeng, Zhichen; Zhu, Yada; Hamann, Hendrik; Tong, Hanghang (August 2024, ACM)

Full Text Available
Allo: A Programming Model for Composable Accelerator Design

https://doi.org/10.1145/3656401

Chen, Hongzheng; Zhang, Niansong; Xiang, Shaojie; Zeng, Zhichen; Dai, Mengjia; Zhang, Zhiru (June 2024, Proceedings of the ACM on Programming Languages)

Special-purpose hardware accelerators are increasingly pivotal for sustaining performance improvements in emerging applications, especially as the benefits of technology scaling continue to diminish. However, designers currently lack effective tools and methodologies to construct complex, high-performance accelerator architectures in a productive manner. Existing high-level synthesis (HLS) tools often require intrusive source-level changes to attain satisfactory quality of results. Despite the introduction of several new accelerator design languages (ADLs) aiming to enhance or replace HLS, their advantages are more evident in relatively simple applications with a single kernel. Existing ADLs prove less effective for realistic hierarchical designs with multiple kernels, even if the design hierarchy is flattened. In this paper, we introduce Allo, a composable programming model for efficient spatial accelerator design. Allo decouples hardware customizations, including compute, memory, communication, and data type from algorithm specification, and encapsulates them as a set of customization primitives. Allo preserves the hierarchical structure of an input program by combining customizations from different functions in a bottom-up, type-safe manner. This approach facilitates holistic optimizations that span across function boundaries. We conduct comprehensive experiments on commonly-used HLS benchmarks and several realistic deep learning models. Our evaluation shows that Allo can outperform state-of-the-art HLS tools and ADLs on all test cases in the PolyBench. For the GPT2 model, the inference latency of the Allo generated accelerator is 1.7x faster than the NVIDIA A100 GPU with 5.4x higher energy efficiency, demonstrating the capability of Allo to handle large-scale designs.
more » « less
Full Text Available
InterFormer: Effective Heterogeneous Interaction Learning for Click-Through Rate Prediction

https://doi.org/10.1145/3746252.3761527

Zeng, Zhichen; Liu, Xiaolong; Hang, Mengyue; Liu, Xiaoyi; Zhou, Qinghai; Yang, Chaofei; Liu, Yiqun; Ruan, Yichen; Chen, Laming; Chen, Yuxin; et al (November 2025, ACM)

Free, publicly-accessible full text available November 10, 2026
Graph Mixup on Approximate Gromov-Wasserstein Geodesics

Zeng, Zhichen; Qiu, Ruizhong; Xu, Zhe; Liu, Zhining; Yan, Yuchen; Wei, Tianxin; Ying, Lei; He, Jingrui; Tong, Hanghang (July 2024, ICML)

Full Text Available
Hierarchical Multi-Marginal Optimal Transport for Network Alignment

https://doi.org/10.1609/aaai.v38i15.29605

Zeng, Zhichen; Du, Boxin; Zhang, Si; Xia, Yinglong; Liu, Zhining; Tong, Hanghang (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Finding node correspondence across networks, namely multi-network alignment, is an essential prerequisite for joint learning on multiple networks. Despite great success in aligning networks in pairs, the literature on multi-network alignment is sparse due to the exponentially growing solution space and lack of high-order discrepancy measures. To fill this gap, we propose a hierarchical multi-marginal optimal transport framework named HOT for multi-network alignment. To handle the large solution space, multiple networks are decomposed into smaller aligned clusters via the fused Gromov-Wasserstein (FGW) barycenter. To depict high-order relationships across multiple networks, the FGW distance is generalized to the multi-marginal setting, based on which networks can be aligned jointly. A fast proximal point method is further developed with guaranteed convergence to a local optimum. Extensive experiments and analysis show that our proposed HOT achieves significant improvements over the state-of-the-art in both effectiveness and scalability.
more » « less
Full Text Available

« Prev Next »

Search for: All records